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Description 

Circuit arrangement and signal -processing device 

5 The invention relates to a circuit arrangement and to a 

signal-processing apparatus. 

For automatic speech recognition purposes, it is known 
for a Fast Fourier Transform (FFT) to be applied to a 
10 digitized input speech signal, for spectral analysis of 
the input speech signal. Features which are used for 
automatic speech recognition purposes are derived from 
the power spectrum formed by means of the Fast Fourier 
Transform (see [1] ) . 

15 

In a Fast Fourier Transform such as this, a time window 
of predetermined duration is normally used, with one 
signal element which is represented by the respective 
time window in each case being subjected to a Fast 
20 Fourier Transform. This leads to restricted frequency 
resolution and time resolution. 

If, as is normal for speech recognition, only the power 
spectrum and thus the magnitude spectrum of the 

25 respective signal element is used, the time resolution 
is limited by the duration of the time window that is 
used. This limitation on the time resolution is a 
factor which leads to a restriction to the performance 
of already known speech-recognition systems. One 

30 problem with sound processing systems and the use of a 
time window such as this of a fixed predetermined size 
is that, if the power spectrum varies, an error is 
formed after reverse-transformation to the time domain, 
because of the finite nature of the time window. 

35 

According to alternative approaches for automatic 
speech recognition, as described in [2], [3] and [4], 
filter banks are used. These filter banks are intended 
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to stimulate the characteristics of the human inner 
ear. 

Furthermore, [5] describes a model of the inner hair 
5 cells (IHC) and of the auditory nerves, with the 

basilar membrane being simulated using different 
filters and a compression unit, as well as an 
amplification unit. Furthermore, [5] describes a 
vesicle pool model . 

10 

Furthermore, [6] describes a model of the signal- 
processing in the human hearing, with an acoustic sound 
signal being used as an input signal according to this 
model. The input signal is subdivided into a plurality 

15 of frequency channels, analogously to the 
frequency-location transformation in the inner ear, 
with half-wave rectification as well as low-pass 
filtering being provided for each channel, an 
adaptation being provided for amplification of sudden 

20 changes in the input signal, and for attenuation of 
components of the input signal which are essentially 
constant over time. 

[7] Describes the phenomenon, which is referred to as 
25 "recruitment", as well as the so-called dynamic 
compression in order to compensate for the recruitment 
phenomenon. The dynamic compression results in a wide 
sound-level value range, which occurs in the acoustic 
environment, being "compressed" to a range which can be 
30 perceived by human beings. 

[8] Describes a digital simulation of a one-dimensional 
long-wave model of the basilar membrane of a human 
being, using a wave digital filter structure. 

35 

[9] Describes a growth function, in which the 
oscillation of the basilar membrane is described with 
respect to the sound pressure, measured upstream of the 
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tympanic membrane of an experimental animal . 

Furthermore, [10] describes the biological structure of 
the inner hair cells. 

5 

[11] Describes amplifier circuits, in which the 
amplifiers are used to improve the comprehensibility 
for a sound projection system. The voltage-controlled 
amplifiers described in [11] are controlled by a gain 

10 control signal which is produced by means of a buffer 
amplifier and is formed by a combination network from 
the signal elements, as produced by means of a filter 
bank, in different frequency channels. The 
voltage-controlled amplifiers as described in [11] 

15 process four different signals in four different 
frequency channels. 

The aim of the circuit arrangement described there is 
to change the output signal produced by the respective 
20 voltage-controlled amplifiers to the same level as the 
amplitude of the signal in the baseband channel. 

The invention is based on the problem of specifying a 
circuit arrangement as well as a signal-processing 
25 apparatus for provision of features to describe a 
signal which is supplied to the circuit arrangement or 
to the signal-processing apparatus, with the features 
being more robust with respect to any interference 
noise that occurs. 

30 

The problem is solved by a circuit arrangement and by a 
signal-processing apparatus having the features as 
claimed in the independent patent claims. 

35 Preferred refinements of the invention are specified in 
the dependent claims . 
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A circuit arrangement has a filter bank with a 
plurality of filter stages and a filter-bank input to 
which an input signal can be supplied. Furthermore, the 
circuit arrangement has a plurality of resonator 
5 circuits for generation of in each case one output 
signal element from the input signal, with each 
resonator circuit in each case being associated with at 
least one filter stage from the plurality of filter 
stages, and being coupled to an output of the 

10 respective filter stage. Each resonator circuit has a 
capacitance, an inductance and a resonator output, at 
which the respective output signal element can be 
produced. Furthermore, at least one resonator control 
circuit for open-loop or closed-loop control of the Q- 

15 factor of at least one resonator circuit is provided in 
the circuit arrangement, with the resonator control 
circuit being designed in such a manner that it 
provides open-loop or closed-loop control for the Q- 
factor of the at least one resonator circuit as a 

20 function of the time profile of the signal amplitude of 
the input signal and/or of the output signal element 
from the at least one resonator circuit. 

A signal-processing apparatus has a circuit arrangement 
25 as described above as well as a further-processing unit 
for further-processing of the signal which is produced 
by the circuit arrangement. 

The combination of a filter bank, preferably of a 
30 linear filter bank, with the resonator circuits which 
as can be seen form non-linear compression stages 
simulates, according to the invention, the non-linear 
oscillation response of the inner ear of mammals, very 
well . 

35 

Features in the course of a feature extraction process 
on an input speech signal are as can be seen produced 
as the output signal of each resonator circuit, which 
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features are more robust to inference noise and, in 
particular for the purposes of a speech-recognition 
system, this leads to an improved word error rate in a 
feature extraction system (formed by the circuit 
5 arrangement) provided according to the invention. 

In particular, as the interference noise increases, the 
word error rate increases more slowly than in the 
course of the traditional initial processing using a 
10 Fast Fourier Transform, applied to power spectra which 
have been split on a time-window basis. 

A further advantage which is achieved by the invention 
is the large-scale maintenance of the fine time 

15 structure of the recorded speech signal, in general of 
the analog input signal which is supplied to the 
circuit arrangement, while, in the case of the power 
spectrum when using a Fast Fourier Transform, all that 
is achieved is time resolution of the features in the 

20 region of the window length of the window that is used. 

Thus, according to the invention, the biological 
structure and, in particular, the essential 
characteristics of the human hearing system are, as can 
25 be seen, simulated in a better manner than in the prior 
art for the purposes of feature extraction of features 
in an acoustic signal. This leads to a more robust 
speech-recognition system. 

30 The invention can thus clearly be seen in the 
extraction of features for, for example, automatic 
speech recognition, that is to say for an automatic 
speech-recognition system. In particular, features are 
produced from an input signal which is supplied to the 

35 circuit arrangement, which features allow more 
robustness to interference noise than can be achieved 
according to the prior art. 
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Alternatively, the invention can be used highly 
advantageously for a hearing aid, for example for a 
cochlea implant, particularly in the case of patients 
with impaired hearing caused by the inner ear. 

5 

According to the invention, the Q-factor of the 
respective resonator circuit is adjusted on the basis 
of the amplitude of the input signal or output signal 
element. If the amplitude of one of these signals is 

10 very high, then the resonator control circuit can be 
used to reduce the Q-factor of the at least one 
resonator circuit to such an extent that the signal is 
highly attenuated. In contrast, in the case of a 
low-amplitude signal, the Q-factor can be increased to 

15 such an extent that the signal amplitude at the output 
of the respective resonator circuit is amplified. 

As can be seen, according to the invention, in order to 
carry out dynamic compression, use is made of the fact 
20 that a resonator circuit acts a stable amplifier when 
close to its resonant frequency (resonance peak) . 

The Q-factor of the resonator circuit used here is 
understood to mean the ratio of the amplitude of the 

25 output signal at the resonant frequency of the 
resonator circuit to the corresponding amplitude of the 
input signal. The Q-factor of a resonator circuit 
depends on its resistance, so that the Q-factor can be 
adjusted, for example, by means of open-loop or 

30 closed-loop control of the resistance of the resonator 
circuit . 

In a scenario in which the Q-factor of the resonator 
circuit is adjusted on the basis of the amplitude of 
35 the input signal that is input in the resonator 
circuit, the functionality of the control circuit can 
be referred to as "open-loop control". If, in contrast, 
the Q-factor of the resonator circuit is adjusted on 
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the basis of the amplitude of the output signal, then 
the resonator control circuit carries out a 
"closed-loop control" functionality, since it uses 
feedback to adapt the Q-f actor. 

5 

The circuit arrangement according to the invention 
allows reliable and effective dynamic compression of an 
input signal in the time domain, without the 
disadvantages of a Fourier Transform occurring. In 

10 particular, this avoids the problems of a finite time 
window that occur with a Fourier Transform according to 
the prior art. Furthermore, according to the invention, 
a dynamically compressed output signal is generated 
which, for example, has considerably less interference 

15 signal distortion than that with the reverse 
transformation of the logarithmic Fourier spectrum. 

According to the invention, sufficiently powerful and 
intensity-selective (for example non-linear) 

20 attenuation of an input signal is made possible by 
selectively producing the Q-factor of the resonator 
circuit . 

As can be seen, the circuit arrangement has a filter 
25 circuit, with the frequency range for which the 
resonator circuit passes signals being defined on the 
basis of the value of the inductance L and of the 
capacitance C of the resonator circuit. The values L, C 
can thus be set to create a simple capability to adjust 
30 the mean frequency of the transmittable interval of the 
resonator circuit. The width of the resonance curve of 
the resonator circuit can, in particular, be adjusted 
by adjustment of its Q-factor. The resonator circuit 
may be regarded as a filter with non-linear 
35 attenuation, which in principle allows any desired 
level of dynamic compression to be achieved. Because 
the processing is carried out over a sufficiently 
narrow bandwidth, it is also possible to keep 
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adequately low distortion which can occur as a result 
of excessive non-linearity. 

The circuit arrangement may include a second-order 
5 resonator circuit, in which the attenuation rises 

non-linearly as the sound level rises. In the case of a 
passive implementation of the circuit arrangement, that 
is to say when using passive components (coil L, 
capacitor C, resistance R) , it is possible to produce a 
10 stable circuit (in contrast to systems which require an 
active amplifier with feedback) . 

The resonator circuit may have a resistance which can 
be subjected to open-loop control (or closed-loop 

15 control) by means of the resonator control circuit. A 
resistance which can be subjected to closed-loop or 
open-loop control such as this is a simple circuit 
element, by means of which the functionality of 
closed-loop control of the Q-factor of the resonator 

20 circuit can be carried out with little complexity, as 
well as accurately and in a stable manner. 

It should be noted that the resonator control circuit 
can be formed from a plurality of resonator control 
25 circuit elements, in which case one resonator control 
circuit element in each case provides open-loop or 
closed-loop control for the respective Q-factor of the 
resonator circuit associated with it. 

30 The input signal may be provided between a first 
connection of the resistance and a first connection of 
the capacitance. The output signal may be produced 
between the first connection of the capacitance and a 
second connection of the capacitance. A second 

35 connection of the resistance may be coupled to a first 
connection of the inductance, and a second connection 
of the inductance may be coupled to a second connection 
of the capacitance. 
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The resonator control circuit may be designed in such a 
manner that it controls the Q-factor of the at least 
one resonator circuit on the basis of a Boltzmann 
function, in which the amplitude of the output signal 
5 is included as a parameter. If the parameters contained 
in it are chosen appropriately, a Boltzmann function is 
highly suitable for approximation of the sensitivity 
curve of the outer hair inner cells in the human inner 
ear. A particularly good description of this biological 

10 relationship can be provided by means of a second-order 
Boltzmann function. This makes it possible to 
approximate the sensitivity curve in the human ear, as 
is advantageous for applications of the circuit 
arrangement in the medical field (for example for a 

15 hearing aid) . 

The resonator control circuit may be designed in such a 
manner that it adjusts the Q-factor of the at least one 
resonator circuit as a function of the amplitude of the 

20 output signal, based on a sensitivity characteristic 
determined for a human ear. In order to simulate the 
sensitivity characteristic in the inner ear of a human 
being particularly well by means of a circuit 
arrangement according to the invention, a sensitivity 

25 characteristic (which has been determined, for example, 
experimentally or theoretically) of the human ear can 
be stored in the form of a file or table, such that it 
is accessible to the control circuit. In this case, the 
resonator control circuit can provide open-loop or 

30 closed-loop control for the Q-factor of the at least 
one resonator circuit, in such a manner that the 
biological sensitivity characteristic stored in it is 
approximated. 

35 The resonator control circuit may be designed in such a 
manner that it adjusts the Q-factor of the at least one 
resonator circuit to be less the higher the amplitude 
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of the respective output signal element from the 
respective resonator circuit is. 

The resonator control circuit may also be designed in 
5 such a manner that it adjusts the Q-f actor of the at 

least one resonator circuit as a non-linear function of 
the amplitude of the respective output signal element. 
That is to say signal areas of high amplitude are 
attenuated to a more than proportional extent in 
10 comparison to signal areas of low amplitude. Thus, even 
if there is an extremely wide range of sound levels in 
an input signal, it is possible to achieve a compromise 
with a sufficiently narrow range in the output signal. 

15 The resonator control circuit can be designed in such a 
manner that it adjusts the Q-factor of the at least one 
resonator circuit in such a manner that the amplitude 
of the respective output signal element is within a 
predetermined interval. For specific applications, it 

20 may be advantageous to keep the amplitude of an output 
signal element within a predetermined interval in all 
cases. By way of example, this may be important for 
data compression purposes, when a signal with a high 
degree of intensity fluctuation is intended to be 

25 detected with as few quantization levels as possible. 
In this case, the resonator control circuit can be 
designed in such a manner that it provides open-loop or 
closed-loop control for the Q-factor of the resonator 
circuit in such a manner that the respective output 

30 signal element is within the predetermined interval. 

The circuit arrangement may have a plurality of 
series-connected resonator circuits, in which case an 
output from a respective upstream resonator circuit can 
35 be produced as the input signal for the respective 
resonator circuit connected downstream from it. 
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As can be seen, according to this particularly- 
advantageous refinement, a filter bank with a plurality 
of resonator circuits connected in series is provided, 
thus making it possible to extend the dynamic 
5 compression to an even wider dynamic range. In 
principle, a sufficiently high degree of dynamic 
compression (for example 60 db) can be achieved just 
with one filter stage (that is to say with one 
resonator circuit) with a very high Q-factor Q (for 

10 example Q = 1000, which is reduced to a Q-factor of 
Q = 1 for high levels) . A circuit arrangement such as 
this has a very narrow bandwidth, however (for example 
0.1% of the resonant frequency of the resonator 
circuit) . According to the invention, a sufficiently 

15 high degree of dynamic compression (for example of 
60 dB) can be achieved by cascading a plurality of 
filter stages (for example three series-connected 
filter stages) with a relatively low Q-factor Q (for 
example Q = 10, so that Q^ = 1000) . The individual 

20 Q-factors, which are not as high, of each of these 
filters result in the advantageous effect that, because 
of the increased bandwidth of the individual filters 
which results from the lower Q-factor, the filters 
cover a wider frequency range, and the impulse response 

25 by the filters is at the same time improved, that is to 
say the transient times of the system, on switching on 
and off, are considerably shorter. 

As can be seen, the series-connected resonator circuits 
30 are coupled directly to one another in such a manner 
that the output voltage of an upstream resonator 
circuit is at the same time the input voltage to the 
resonator circuit connected downstream from it, and in 
such a manner that the output current (which is 
35 generally not zero during operation) from an upstream 
resonator circuit is at the same time the input current 
for the resonator circuit connected downstream from it. 
For this purpose, the circuit arrangement generally has 
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no intermediate element between upstream and downstream 
resonator circuits. This can be achieved by means of a 
circuit arrangement in which the second connection of 
the coil of an upstream resonator circuit is coupled to 
5 the first connection of the resistance in the resonator 
circuit which is connected downstream from the upstream 
resonator circuit. 

Alternatively, as can be seen, the series-connected 

10 resonator circuits can be free of direct coupling, that 
is to say they can be decoupled from one another in a 
certain manner, in particular by the interposition of 
an intermediate element between the output of an 
upstream resonator circuit and the input of a 

15 downstream resonator circuit. This is preferably 
implemented in such a manner that the output voltage 
from an upstream resonator circuit is at the same time 
the input voltage for the resonator circuit connected 
downstream from it, and in such a manner that the 

20 output current from an upstream resonator circuit is 
equal to zero. The input current to the downstream 
resonator circuit essentially results from the 
impedance of this resonator circuit. In the case of a 
circuit arrangement such as this, an operational 

25 amplifier is preferably provided (as an impedance 
converter) as the intermediate element between an 
upstream resonator circuit and the resonator circuit 
connected downstream from it. A first input of the 
operational amplifier is coupled to the second 

30 connection of the coil of the upstream resonator 
circuit. A second input of the operational amplifier is 
fed back to an output of the operational amplifier, and 
is coupled to the first connection of the resistance in 
the resonator circuit which is connected downstream 

35 from the upstream resonator circuit. 

In order to reduce the computation power, the Q-factor 
of all the series-connected resonator circuits can be 
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set to be identical. In this case, the computation 
power which is required by the resonator control 
circuit is kept particularly low, since a common 
Q-factor is determined and set for all of the resonator 
5 circuits, that is to say all of the filter parameters 
are identical. If a circuit arrangement with a 
particularly high quality requirement is needed, then, 
alternatively, the Q-factor of the various 
series-connected resonator circuits can be set to be 
10 different for optimization purposes. In a circuit 
arrangement such as this, the Q-factor of each of the 
series-connected resonator circuits is thus set 
individually. 

15 The circuit arrangement preferably has a plurality of 
parallel-connected branches, with each branch having 
one resonator circuit or a plurality of 
series-connected resonator circuits. In this case, the 
Q-factor of each resonator circuit can be subjected to 

20 open-loop or closed-loop control by means of the 
respective resonator control circuit. 

According to this particularly advantageous development 
of the invention, a plurality of parallel-connected 
25 branches of resonator circuits are clearly provided, in 
which case a plurality of resonator circuits may be 
connected in series in each branch. 

The at least one resonator circuit in each branch is 
30 preferably designed in such a manner that it passes a 
respective frequency range of the input signal in such 
a manner that the branches together pass a cohesive 
frequency interval. The frequency range for which the 
human ear is sensitive is between about 20 Hz and 
35 20 kHz. In order to cover this hearing frequency range, 
the frequency ranges of the signals which can be 
transmitted in different channels in the parallel 
arrangement of resonator circuits are in general 
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different. The frequency range of signals which can be 
transmitted in one resonator circuit is a distribution 
curve around the resonant frequency, with a specific 
3-D width. The resonant frequency can, as can be seen, 
5 be adjusted by adjustment of the values L, C of the 
resonator circuit, and the 3dB width can be adjusted by 
adjustment of the respective Q-f actor. If the various 
frequency pass-bands of the various branches of 
resonator circuits are joined together, then this 
10 results in a preferably cohesive frequency interval, by 
means of which the sensitivity range of the human 
hearing, or of any other frequency range of interest, 
can be covered. 

15 The frequency ranges which the various branches pass 
preferably at least partially overlap one another. This 
ensures that all of the frequencies are covered, and 
that the signal components from the individual branches 
can be combined. 

20 

The frequency range which a respective branch passes 
can preferably be predetermined by adjustment of the 

value of the capacitance and/or of the inductance of 
the at least one resonator circuit in that branch. This 
25 is based on the fact that the resonant frequency of a 
resonator circuit depends on the values of the 
inductance and of the capacitance. 

The circuit arrangement according to the invention is 
30 preferably designed to process an acoustic signal as 
the input signal. In this case, the circuit arrangement 
according to the invention is suitable for use in a 
speech processing system. A system such as this may, 
for example, be based on pulsating neural networks 
35 which are instructed to reduce the dynamic range. 
Further fields of application are systems for sound 
processing and (audio) data compression, when signals 
with high amplitudes are intended to be recorded with 
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as few quantization levels as possible. Furthermore, 
there are applications in the medical field, in 
particular as a hearing aid for patients with loud- 
noise deafness. 

5 

The circuit arrangement according to the invention can 
be produced using digital or analog circuit technology. 

At least a part of the circuit arrangement, in 
10 particular the filters, the open-loop or closed-loop 
control functionality of the resonator control circuit, 
may be in the form of a computer program. The invention 
can be implemented by means of a computer program, that 
is to say in software, and by means of one or more 
15 specific electrical circuits, that is to say in 
hardware or in any desired hybrid form, that is to say 
by means of software components and hardware 
components . 

20 A software implementation, in particular of the control 
circuit, can be produced, for example, using "C++". An 
implementation may be based on any desired processor or 
DSP (digital signal processor) , or else or an FPGA 
module. FPGA ("Field Programmable Gate Array") is an 

25 integrated programmable circuit which in general has a 
large number of programmable cells on one chip. 

According to another refinement of the invention, 
provision is made for the preferably linear filter bank 
30 to be in the form of a linear wave digital filter. 

Furthermore, a plurality of high-pass filters may be 
provided, with each filter stage having at least one 
associated high-pass filter, and with one high-pass 
35 filter in each case being coupled to the output of a 
respective resonator circuit. These high-pass filters 
simulate the liquid coupling of the hair bundles of the 
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senor cells in the inner ear to the oscillation of the 
basilar mertibrane. 

The use of at least one high-pass filter per filter 
5 stage, coupled to the output of the respective 
resonator circuit, results in the otherwise relatively 
flat high-frequency filter flank of the filter bank 
being sharpened. At least some of the high-pass 
filters, preferably all of the high-pass filters, are 

10 preferably in the form of first-order high-pass 
filters. According to one development of the invention, 
the cut-off frequency of at least some of the 
first-order high-pass filters is chosen in such a 
manner that it corresponds to the frequency of the 

15 maximum sensitivity of a basilar membrane oscillation 
of an inner ear of a mammal. 

According to another refinement of the invention, a 
plurality of rectifier circuits are provided, with one 

20 rectifier circuit in each case being associated with 
one of the filter stages and one high-pass filter, and 
being coupled to one output of a respective high-pass 
filter, preferably as well as a plurality of low-pass 
filters, with one low-pass filter in each case being 

25 associated with one rectifier circuit and being coupled 
to one output of a respective rectifier circuit. These 
refinements result in a very good approximation to the 
formation of the receptor potential Um in the human 
hearing system. 

30 

Furthermore, a plurality of activation circuits may be 
provided, with one activation circuit in each case 
being associated with one of the filter stages, and 
with each activation circuit being designed to amplify 
35 any rate of change of a signal supplied to the 
activation circuit, and for attenuation of components 
of the signal which is supplied to the activation 
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circuit which components are essentially constant over 
time . 

Furthermore, each activation circuit preferably has a 
5 vesicle pool circuit with a large number of vesicle 
circuits . 

The signal-processing apparatus according to the 
invention, which has a circuit arrangement according to 
10 the invention, will be described in more detail in the 
following text. The refinements of the 

signal-processing apparatus also apply to the circuit 
arrangement, and vice versa. 

15 In the signal-processing apparatus, the 

further-processing unit may be a speech-recognition 
device or a hearing aid. 

One implementation of the further-processing unit in 

20 the form of a hearing aid is intended in particular for 
an application in which dynamic compression is carried 
out in order to compensate for disturbances in the 
volume awareness of those who are deaf. In those with 
disturbed hearing, the outer hair cells may be 

25 defective, as a result of which the increase in 
sensitivity fails at low sound levels. The hearing 
then, as can be seen, always operates with the 
sensitivity that it has at high sound levels. This 
leads to narrowing (recruitment) of the useable range 

30 of sound levels between the hearing threshold (very 
quiet) and the unpleasantness threshold (very loud) . In 
order to compensate for this phenomenon, the circuit 
arrangement for the signal-processing apparatus 
according to the invention can be used to carry out 

35 dynamic compression, which, as can be seen, compresses 
the wide sound level range from the acoustic 
environment to the range to be perceived by the 
patient . 
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The signal-processing apparatus may also form the input 
of a speech-recognition system, in particular in a 
pulsating neural network architecture. 

5 The signal-processing apparatus may be designed as an 

analog or digital filter bank. 

The following principles must clearly be maintained 
within the scope of the invention: 

.0 

a) the input signal provided for the circuit 
arrangement is subjected to frequency analysis 
with non-linear dynamic compression; 



15 b) a reduction in information was provided by "soft" 
half-wave rectification of the frequency channels 
that are formed, with a threshold value and 
saturation; 



20 c) speech-relevant modulation frequencies are 
emphasized by simulation of the neural adaptation 
in the hearing system of a mammal, in particular 
of a human being. 



25 Exemplary embodiments of the invention will be 
explained in more detail in the following text and are 
illustrated in the figures, in which: 



Figure 1 shows a circuit arrangement according to one 
30 preferred exemplary embodiment of the 

invention. 



Figure 2 shows a resonator circuit according to one 
exemplary embodiment of the invention. 

Figure 3 shows an implementation of the resonator 
circuit as shown in Figure 2, in the form of 
a wave digital filter. 
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Figures 4 and 5 show diagrams to illustrate the 
functionality of the circuit arrangement 
shown in Figure 1, 

5 Figure 6a shows a circuit arrangement element according 
to another exemplary embodiment of the 
invention. 

Figure 6b shows an implementation of the resonator 
10 circuits illustrated in Figure 6a as wave 

digital filters. 

Figure 7a shows a circuit arrangement element according 
to another exemplary embodiment of the 
15 invention. 



Figure 7b shows an implementation of the resonator 
circuits illustrated in Figure 7a as wave 
digital filters, 

20 

Figure 8 shows a block diagram of a speech-recognition 
system according to one exemplary embodiment 
of the invention. 



Figure 9 shows a circuit diagram of a linear filter 
bank and of a plurality of resonator circuits 
associated with individual filter stages in 
the filter bank according to one exemplary 
embodiment of the invention. 

Figure 10 shows a diagram illustrating an excitation 
pattern for a non-linear basilar membrane 
model for a 1 kHz tone. 



35 Figure 11 shows a circuit diagram of a circuit element 
which is in each case connected in series 
with a respective resonator circuit in order 
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to form the circuit arrangement according to 
one exemplary embodiment of the invention. 



Figure 12 shows an illustration of modeled nerve action 



Figure 13 shows a diagram illustrating different 



Figure 8 shows a speech-recognition system 800 
15 according to one exemplary embodiment of the invention. 

For the purposes of the speech recognition system, a 
feature-extraction system 801 is used which extracts 
from the analog speech signals applied to it the 
20 features which are used for the actual automatic 
speech-recognition process (illustrated in Figure 8 by 
a speech-recognition block 802) . 

The feature-extraction system 801 has, in particular, 
25 components for initial filtering, a filter bank and 
components for non-linear feature extraction. 

As can be seen, the feature-extraction system 801 
simulates the signal-processing strategy and the 
30 signal-processing structure of the human hearing 
system. This is scaled in physical units analogously to 
the signal-processing system of the human hearing 
system. 



5 



potentials, and 



10 



speech-recognition rates for a 

speech-recognition system according to one 
exemplary embodiment of the invention, 
compared with a speech-recognition system 
according to the prior art. 



35 



The feature-extraction system 801 is supplied with an 
input signal 803 in analog form, as a sound pressure 
signal (measured in Pascal) . 
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A first component 804 of the feature-extraction system 
801 forms a model of the auditory channel which, 
however, is optional and is ignored in the preferred 
implementation . 

5 

The signal 8 05, which is formed by the model of the 
auditory channel 804 from the input signal 803, is 
supplied to a middle-ear model component 806. 

10 As illustrated in Figure 9, the middle-ear model 
component 806 has an ideal spring component 901 and an 
ideal damping component 902 connected in parallel 
(implemented in the form of an electrical core, that is 
to say an inductance, for the ideal spring and in the 

15 form of a resistance as the damping component) . The 
middle-ear model component 90 6 is designed in such a 
manner that the speech-relevant region of the spectrum 
of the input signal is emphasized, that is to say it is 
amplified. By analogy with the human hearing system, 

20 the signal 807 which is produced by the middle-ear 
model component 806 is supplied to an inner-ear model 
component 808. 

The signals 809 which are formed by the inner-ear model 
25 component 808 can optionally be used directly as 
extracted speech-recognition features for speech 
recognition, or can be supplied to a sensor-cell model 
component 810, which will be described in more detail 
in the following text. The signals 811 which are 
30 produced by the sensor-cell model component 810 can 
likewise be used directly as feature components for 
automatic speech recognition purposes, or may be 
processed further and, in the course of this further 
processing, may be supplied to a synaptic-model 
35 component 812, which simulates the synaptic mechanism 
of the human hearing system. The signals 813 which are 
formed by the synaptic-model component 812 are likewise 
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used according to the invention as features for 
automatic speech recognition purposes. 

In this context, it should be noted that, on the basis 
5 of alternative refinements of the invention, one, two 
all or three of the feature signals 809, 811, 813 
described above may be used optionally for speech 

recognition purposes. 

10 Within the human hearing system, the sound pressure 
which arrives at the tympanic membrane (the tympanic 
membrane has a surface area of Aed= 55x10"^ m^) is 
converted to a mechanical deflection of the middle-ear 
bones (quoted in m) . 

15 

As has been described above, the middle-ear model 
component 806 has a low-pass filter, according to this 
exemplary embodiment a first-order low-pass filter, 
preferably with a low-pass filter cut-off frequency of 
20 1 kHz. 

The spring constant that is simulated by means of the 
inductance 901 is about 1500 N/m, and the damping 
component which is represented by means of the 
25 resistance 902 is designed to be 0.25 Ns/m. 

The signal 807 which is produced by the middle-ear 
model component 806 is injected into the inner-ear 
model component 808, as is illustrated in detail in 
30 Figure 9. The inner-ear model component 808 is in the 
form of a filter bank 903, according to this exemplary 
embodiment in the form of a linear wave digital filter 
model . 

35 The filter bank 903 has a large number of filter stages 
904, 905, 906, as well as a terminating resistance 907. 



wo 2005/09350 



23/8 PCT/DE2005/000156 



Each filter stage 904, 905, 906 is formed by an 
inductance 904a, 905a, 906a, a resistance 904b, 905b, 
906b and a capacitance 904c, 905c, 906c, connected in 
series . 

5 

The speed of basilar membrane oscillation in the human 
inner ear in each case corresponds to the current in 
one filter stage 904, 905, 906. The deflection of the 
basilar membrane can thus be calculated by integration 
10 of the speed. In order to avoid numerical problems 
during the integration process, it is, however, 
worthwhile calculating the deflection in a different 
way: 

15 In the case of a spring, as represented by the 
capacitances 904c, 905c, 906c, the instantaneous 
deflection x can be calculated as the product of the 
spring force and the spring constant. This deflection x 
forms the input, that is to say the input signal, for 

20 the dynamic-compression stages 101, according to this 
exemplary embodiment of the invention, in each case two 
resonator circuits 101 connected in series, as will be 
explained in more detail in the following text. 

25 The resonator circuits 101, as can be seen, form 
compression stages using second-order resonators, as 
will be explained in more detail in the following text. 

In one alternative preferred embodiment, four resonator 
30 circuits 101 are provided connected in series for each 
filter stage 904, 905, 906. 

As will likewise be explained in more detail in the 
following text, the Q-factor of the resonator circuits 
35 101 is in each case modulated in a range from unity up 
to a maximum Q-factor value Qmax (depending on the 
position in the person's inner ear) instantaneously as 
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a function of the output signal from each filter stage 
904, 905, 906. 

The design of the resonator circuits 101 will be 
5 explained in more detail in the following text with 
reference to Figure 1. 

Circuit arrangement 100 contains a large number of 
resonator circuits 101, each of which has a capacitance 

10 and an inductance (not shown in Figure 1) , as well as 
an input to which an input signal can be applied, and 
an output at which an output signal can be produced. 
Three of the resonator circuits 101 are in each case 
connected in series along a respective row of the 

15 arrangement, which is in the form of a matrix, so that 
a respective output of an upstream resonator circuit 
101 is coupled to a respective input of a resonator 
circuit 101 connected downstream from it. The values of 
the inductance and of the capacitance in the resonator 

20 circuit 101 in one row are in each case chosen in such 
a manner that the respective row can transmit a signal 
of a corresponding frequency interval in a region 
around the resonant frequency of the resonator circuit 
101 for that row. The resonator circuits 101 for 

25 different rows each have different L, C values, so 
that, overall, the individual rows or branches of 
resonator circuits 101 cover a cohesive frequency 
interval, which corresponds to the sensitivity range of 
the human hearing (approximately 20 Hz to 20 kHz) . 

30 

A resonator control circuit 111 is coupled to all of 
the resonator circuits 101 in a communication link, 
that is to say the control circuit 111 is coupled to 
all of the resonator circuits. The Q-f actor of each 
35 individual one of the resonator circuits 101 can be 
adjusted by means of the control circuit 111 in order 
to provide open-loop or closed-loop control for the 
Q-factor of the resonator circuits 101, with the 
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control circuit 111 being designed in such a manner 
that it adjusts the Q-factor of the resonator circuits 
101 as a function of the amplitude of an output signal 
from the last resonator circuit 101 in a respective 
5 row. By way of example, the Q-factor of the resonator 
circuits Rn, R12, R13 is adjusted by means of the 
resonator control circuit 111 on the basis of the 
amplitude of a signal at the output of the resonator 
circuit R13. 

10 

Figure 1 also shows a sound source 103 which emits an 
acoustic signal as a global input signal 102. This is 
applied to the inputs of the resonator circuits 101 
(Riif R2if Rkif Rni) in the first column of 

15 resonator circuits 101. 

The resonator circuit 101 Rn arranged in the first row 
and the first column of resonator circuits will be 
analyzed in the following text. The global input signal 

20 102 from the sound source 103 is applied to this at one 
input. The resonator circuit 101 Rn passes a frequency 
component of the global input signal 102 which is 
dependent on the values L and C associated with it, and 
this frequency component is produced as the first local 

25 output signal 104 at one output of the resonator 
circuit Rii. Furthermore, the amplitude of the global 
input signal 102 is changed on the basis of the 
functionality of the resonator circuit 101 Rn, 
depending on its (present) Q-factor Q. The Q-factor Q 

30 of the resonator circuit 101 Rn is controlled by means 
of a resistance (not shown in Figure 1) in the 
resonator circuit 101 Rn, with the control circuit 111 
providing an appropriate control signal for this 
controllable resistance, by which means the impedance 

35 is set to a predetermined value. This results in 
adjustment of the Q-factor of the resonator circuit 
101, so that an input signal is attenuated to a greater 
or lesser extent, corresponding to this value of the 
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Q-f actor, in a subsequent processing cycle. Since the 
circuit arrangement 100 is designed for dynamic 
compression of the global input signal 102, signal 
ranges of high amplitude are, as can be seen, 
5 attenuated to a greater extent than signal ranges of 
low amplitude. 

The first local output signal 104 is provided as the 
first local input signal 105 to the resonator circuit 

10 101 Ri2 connected downstream from the resonator circuit 
101 Rii . The first local input signal 105 passes through 
the resonator circuit 101 R12, with the second local 
output signal 106 being produced at one output. The 
second local output signal 106 is used as the second 

15 local input signal 107 for the resonator circuit 101 R13 
which is connected downstream from the resonator 
circuit 101 R12 . A third local output signal 108 is 
produced at its output 108. This is assembled (added) 
together with the output signals (which are each 

20 related to a separate frequency interval) from the 
respective last resonator circuit 101 (R13, R23, 
Rk3, Rns) arranged in a row to form a global output 

signal 109. 

25 In each of the resonator circuits 101 in a respective 
row of resonator circuits (Rki, Rk2, Rk3) , the Q-factor 
of all of the resonator circuits 101 in the row is 
controlled by means of the resonator control circuit 
111 on the basis of the amplitude of the output signal 

30 at the output of the respective last resonator circuit 
(in the k-th row resonator circuit Rks) . 

The assembled global output signal 109 is thus 
subjected to dynamic compression with respect to the 
35 global input signal 102. 
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The resonator circuit 101 from Figure 1 will be 
described in the following text with reference to 
Figure 2. 

5 An input signal 200 is symbolized as the voltage source 

U in Figure 2. Furthermore, an output signal 204 is 
symbolized as the voltage Uc . The input signal 200 is 
produced between a first connection of a resistance 203 
and a first connection of a capacitance 201. The output 

10 signal 204 is produced between the first connection of 
the capacitance 201 and a second connection of the 
capacitance 201. Furthermore, a second connection of 
the controllable resistance 203 is coupled to a first 
connection of an inductance 202, and a second 

15 connection of the inductance 202 is coupled to the 
second connection of the capacitance 201. 

The value of the resistance R 203 can be adjusted by 
means of the control circuit 111. The resonator circuit 
20 101 from Figure 2 thus clearly represents a filter 
whose attenuation is controllable. 

In the circuit arrangement 100 according to the 
invention, three (or in general N) resonator circuits 

25 101 are connected in series as filter elements without 
any feedback in each row. The time-dependent output 
signal Uc(t), where t is the time, from an upstream 
filter in each case defines the input signal U 200 for 
the filter which is connected downstream from the 

30 upstream filter. 

The resistance R 203 can be varied as a non-linear 
function of the output voltage Uc(t) (closed-loop 
control), as a function of Uc(t) of the respective 
35 upstream filter (open-loop control) , or else 
simultaneously for all of the filters as a function of 
Uc(t) of the last filter stage in a row. 
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The following text describes the calculation rule which 
is used as the basis, according to the described 
exemplary embodiment, for adjustment of the value R of 
a respective resistance R 203. 

5 

First of all, a Q-factor Q to be set is calculated for 
this purpose. 

According to the described exemplary embodiment, the 
10 Q-factor Q of the filter is damped according to a 
Boltzmann function: 

15 In the equation (1), Q(t) is the relationship between 
the Q-factor Q and the time t. Qo is a predeterminable 
maximum Q-factor of the resonator circuit 101 (for 
example Qo = 10) . Qmin is a predeterminable minimum 
Q-factor of the resonator circuit (for example 

20 Qmin = 1) • SAT is a predeterminable saturation 
threshold, that is to say a parameter by means of which 
the time dependency of the Q-factor can clearly be 
adjusted (for example SAT = 1) . 

25 The Boltzmann function (1) approximates the sensitivity 
curve of the outer hair sensing cells in the inner ear. 
If required, the function can be replaced by a 
second-order Boltzmann function, which allows even more 
accurate matching, with the introduction of a further 

30 parameter. Equation (1) uses a simple first-order 
Boltzmann function, since it has only one free 
parameter (specifically SAT) and can thus be processed 
with little numerical complexity. 

35 The value to be set for the non-linear resistance can 
be calculated from the Q-factor Q of the filter using: 
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R(t) = ^Q(t) (2) 

The time-dependent value of the resistance R(t) thus 
depends on the value of the inductance L and of the 
5 capacitance C, as well as on the time-dependent Q 
factor Q(t) . 

Equations (1) and (2), as can be seen, form the control 
rule for adjustment of the value R of the resistance 
10 203 by means of the control circuit 111. 

The filter which is formed by the resonator circuit 101 
illustrated in Figure 2 is linear for very low 
amplitudes Uc(t) (where Q ^ Qo for Uc(t) ^0). It is 

15 likewise approximately linear for very high amplitudes 
Uc(t) (Q Qmin for Uc (t) cx)) . The dynamic compression 
K is carried out in the region of the saturation 
threshold (SAT) and is K = Qo/Qmin- If there are N = 4 
series-connected filter stages (although only three 

20 filter stages are provided in Figure 1 by means of 
three resonator circuits in one row) , and if the values 
are Qo = 10 and Qmin = 1, this allows major compression 
to be achieved, by 80dB (Kn = (Qo/Qmin) '^) . 

25 In order to cover the entire human hearing range, a 
filter bank is provided having resonant frequencies in 
the range from approximately 20 Hz to approximately 
20 kHz, and this is typically achieved by the use of 50 
to 100 rows of resonator circuits 101 (that is to say 

30 n = 50 to n = 100) . According to the described 
exemplary embodiment, the value of the inductance is 
set to L = 1 H. The respective value C is then 
calculated for each row of resonator circuits 101 on 
the basis of the filter frequency fo that is covered by 

35 this row, from the resonator frequency of the 
corresponding LC element: 



wo 2005/09350 30/8 PCT/DE2005/000156 

C = {4n^f^L)-^ (3) 

It should be noted that the non-linear Q-factor for 
each filter frequency fo, that is to say for each row of 
5 resonator circuits 101, is calculated independently. 

With reference to Figure 1, this means that each row of 
oscillator circuits 101 has an associated corresponding 
filter frequency fo for which the value of the Q-factor 
Q(t) is calculated. 

10 

A wave digital filter 300, as one implementation of the 
resonator stage 101 shown in Figure 2, will be 
described in the following text with reference to 
Figure 3. 

15 

A wave digital filter represents one class of digital 
filters with particularly advantageous characteristics. 
These are traditional filters modeled from conventional 
components from information technology, and are 
20 operated with the aid of modern integrated digital 
circuits. On the basis of the technology of a wave 
digital filter, an analog model can, as can be seen, be 
implemented digitally (for example using a computer) . 

25 The components of the wave digital filter 300 in Figure 
3 are, as can be seen, associated in the following text 
with the components of the resonator circuit 101 from 
Figure 2, and the corresponding variables are defined. 

30 A first block 301 of the wave digital filter 300 
contains a reflection-free serial coupler with the 
impedances Rll and R13. Rll, as can be seen, represents 
the controllable resistance R 203, with respect to a 
reference impedance. R21 represents a corrected 

35 impedance (impedance) of the coil L 202 with respect to 
a base frequency. A second block 302 contains a 
parallel coupler, which reflects the parallel 
connection of the capacitance 201, with the admittances 
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G21, G22, G23 being illustrated in the second block. 
G21 is an input admittance of the second block 
(G12=l/R13) 302, G23 is an output admittance of the 
second block 302. The admittance G22 is used to model 
5 the impedance of the capacitance C 201. A third block 
303 represents a store or a filter register for the 
capacitance 201, and a fourth block 304 represents a 
store or a filter register for the coil 202. 

10 The variables illustrated in Figure 3 will be defined 
in the following text. The parameters for a wave 
digital filter, for each filter frequency are: 

R11=R/R_B (4) 

15 

R12=27t F_B L/ (R_B tan [7t F_B/f_s] ) (5) 



R13=R11+R12 (6) 
G21=R13"^ (7) 



G22=27t F_B C R_B/tan(7t F_B/f_s) (8) 



G23=G21+G22 (9) 

25 

In this case, R is the resistance 203 and R_B is a 
predeterminable reference impedance. F_B is a 
predeterminable reference frequency. The values R_B and 
F_B are used for scaling. Since the implementation 

30 based on the described exemplary embodiment is carried 
out using double-precision float variables, this 
normalization is not relevant, although it is if 
integer arithmetic is used. L is the inductance of the 
coil 202. The value f_s is a sampling frequency of the 

35 sampled time signal. The variables Rll, R12, R13 are 
resistances, while in contrast to the variables G21, 
G22 and G23 are admittances, that is to say inverse 
resistances . 
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The filter coefficients gl, g2 are: 

gl=Rll/R13 (10) 

5 

g2=G21/G23 (11) 

The initial values of the filter registers Zl (fourth 
block 304) and Z2 (third block 303) are initialized by 
10 zero. 

The signals at the individual ports can be calculated 
successively. For the "forward wave" of the signal, 
that is to say, as can be seen, the coefficients on the 
15 arrows which are oriented to the right in Figure 3, 
then : 



bl3=-(U+Zl) (12) 
20 b20=-g2 (Z2-bl3) (13) 

B23=b20+Z2 (14) 



The variable U in equation (12) is the input signal 
25 200. 

For the "backward wave", that is to say, as can be 
seen, the arrows which are oriented to the left in 
Figure 3, the coefficients are: 

30 

b22=b20+b23 (15) 



b21=b22+Z2-bl3 (16) 
a0=b21-bl3 (17 



bll=U-gl aO (18) 
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bl2=- (bll+b21) (19) 

The output signal Uc 204 is then calculated to be: 

5 Uc=(b22+Z2 [sec] ) /2 (20) 

The filter registers (blocks 303, 304) are updated as 
follows : 

10 Zl=-bl2 (21) 

Z2=b22 (22) 

The output signal Uc 204 is passed as the input signal 

15 U 200 to the filter stage 101 downstream from the 
filter stage 101 under consideration. The Q-factor to 
be set for the series-connected filters 101 is 
determined again using equation (1) on the basis of the 
output signal Uc 204 from the last filter stage 101 in 

20 one row of filter stages 101. The value of the 
impedance R which governs the attenuation is calculated 
in accordance with equation (2) from the value of the 
Q-factor Q determined in this way. The filter 
impedances (Rll, R12, R13, G21, G22, G23) and filter 

25 coefficients (gl, g2) are calculated again using 
equations (4) to (11) using the amended value of the 
resistance R 203. After this step, the output signal is 
calculated for the next time slice. In other words, the 
time spectrum can be broken down into a plurality of 

30 time slices, which are numerically calculated 
successively. 

A diagram 400 will be explained in the following text 
with reference to Figure 4, in which the functionality 
35 of the circuit arrangement according to the invention 
is shown on the basis of one preferred exemplary 
embodiment of the invention. The diagram 400 relates to 
a circuit arrangement having N=4 series-connected 
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resonator circuits. The maximum Q value is assumed to 
be Qo=10, and the minimum Q-value is assumed to be 

Qmin=l . 

5 The frequency of a signal, normalized with respect to a 

reference frequency fo, is plotted on a logarithmic 
scale along the abscissa 401 of the diagram 400. The 
reaction of the system to an input signal of a specific 
intensity is shown on a logarithmic scale along the 

10 ordinate 402. First to eighth curves 403 to 410 
represent the frequency response (that is to say in 
this case the respective value of the maximum amplitude 
of the filter output) of the circuit arrangement 
according to the invention for different signal 

15 amplitudes (with respect to a reference amplitude) . The 
first curve 403 corresponds to an amplitude of 1x10"^, 
the second curve 404 corresponds to an amplitude of 
1x10"'', the third curve 405 corresponds to an amplitude 
of 1x10"'^, the fourth curve 406 corresponds to an 

20 amplitude of 1x10"^, the fifth curve 407 corresponds to 
an amplitude of 1x10"^, the sixth curve 408 corresponds 
to an amplitude of 1x10°, the seventh curve 409 
corresponds to an amplitude of 1x10^, and the eighth 
curve 410 corresponds to an amplitude of 1x10^. 

25 Furthermore, the input signal is assumed to be a 
sinusoidal oscillation, which is windowed using a cos^ 
window. Curves 403 to 410 are obtained for an entire 
filter bank comprising N=4 series-connected resonator 
circuits without feedback. 

30 

First of all, the diagram 400 shows that the 
attenuation of the input signal becomes greater the 
higher the signal intensity or signal amplitude. If the 
amplitudes are very low, the filters are linear and the 
35 resonant peak is approximately 80dB. In response the 
filter bank decreases very sharply towards high 
frequencies, because the filters are in the form of 
low-pass filters (see Figure 2) . The high-frequency 
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response of the filters falls at approximately 6dB per 
octave, because the filter parameters are scaled by fo. 
The curves in Figure 4 simulate the highly asymmetric 
frequency selectivity of the human hearing, to a good 
5 approximation. 

The relationship between the amplitudes of the input 
signal and of the output signal of a circuit 
arrangement according to the invention will be 
10 described in the following text with reference to 
Figure 5 . 

The sound pressure level Ain is plotted on a logarithmic 
scale in dB along the abscissa in the diagram 500, with 
15 respect to a sound pressure of the reference variable 
20ijPa. The strength of the output signal Aout is plotted 
in dB in arbitrary units along the ordinate 502. Curves 

503 to 507 show (for different scenarios) the growth 
function of a filter cascade comprising four resonator 

20 circuits (N = 4 filters connected in series) at the 
resonant frequency fo. The minimum Q-f actor is assumed 

to be Qmin = 1. 

The first curve 501 shows a linear growth function. The 
25 second curve 504 shows a growth function of the inner 
ear, that is to say the speed of the basilar membrane 
with respect to the sound pressure measured in front of 
the tympanic membrane. The data for the second curve 

504 is taken from [2] . A third curve 505 shows the 
30 curved profile for a Q-factor of Q = 2, a fourth curve 

506 shows the profile for Q = 4, and a fifth curve 507 
shows the profile for Q = 10. 

As can be seen. Figure 5 shows the growth function of a 
35 filter output for f = fo with the filter variable Q as a 
parameter. The growth functions are approximately 
linear at very low and very high amplitudes. The wide 
compression range (in particular for a high Q) is 
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conspicuous, extending over more than four decades. The 
wide dynamic range of the input signal (100 dB) is 
compressed to 4 0 dB (for Q = 10) . Quiet signals are 
"amplified" on a frequency-specific basis because of 
5 the resonant peak. The growth function provides a very 
good simulation of the oscillation responses measured 
on a living hearing system (see the curve 504). The 
circuit arrangement according to the invention 
therefore provides an approximate technical model of 
10 the non-linear preprocessing in the inner ear. 

A circuit arrangement 600 according to another 
preferred exemplary embodiment of the invention will be 
described in the following text with reference to 
15 Figure 6A. 

The circuit arrangement 600 is formed from a first 
resonator circuit 601 and a second resonator circuit 
602, each of which is designed in the same way as the 
20 resonator circuit 101 illustrated in Figure 2. The 
resonator circuit 602 is connected downstream from the 
first resonator circuit 601. 

As can be seen, the circuit arrangement 600 can be 
25 regarded as a directly coupled implementation of two 
(N = 2) series-connected resonator circuits 601, 602. 

As is shown in Figure 6A, the second connection of the 
coil 202 of the upstream resonator circuit 601 is 
30 coupled to the first connection of the resistance 203 
in the downstream second resonator circuit 602. 

According to the exemplary embodiment of resonator 
circuits 601, 602, which are coupled directly to one 
35 another, as shown in Figure 6a, the output voltage Uci 
of the upstream resonator circuit 601 is equal to the 
input voltage to the following resonator circuit 602. 
Furthermore, the output current from the first 
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resonator circuit 601 is equal to the input current to 
the second resonator circuit 602. 

It should be noted that the values of the impedances Rl 
5 and R2, of the inductances LI and L2 and of the 
capacitances CI and C2 of the resonator circuits 601, 
602 may differ from one another, and/or may be 

set/controlled differently. 

One implementation of the resonator circuits 601, 602 
10 illustrated in Figure 6a will be described as a wave 
digital filter 650 in the following text with reference 
to Figure 6B. 

As can be seen, the wave digital filter 650 is formed 
15 from a first component 651, which represents the first 
resonator circuit 601, and from a second component 652, 
which represents the second resonator circuit 602 . The 
two components 651, 652 are directly coupled to one 
another in the manner illustrated in Figure 6b, 
20 corresponding to the coupled configuration of the 
resonator circuits 601, 602 shown in Figure 6a. The 
internal design of each of the components 651, 652 
corresponds essentially to that of the wave digital 
filter 300 shown in Figure 3. 

25 

A circuit arrangement 700 according to yet another 
exemplary embodiment of the invention will be described 
in the following text with reference to Figure 7A. 

30 The circuit arrangement 700 is formed from a first 
resonator circuit 701 and a second resonator circuit 
702, which are connected in series. As can be seen, the 
resonator circuits 701, 702 are connected in series in 
a configuration such that they are decoupled from one 

35 another, that is to say an intermediate element is 
connected between the resonator circuits 701 and 702. 
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Each of the resonator circuits 701, 702 is designed 
essentially in the same way as the resonator circuit 
101 illustrated in Figure 2. Furthermore, an 
operational amplifier 703 is connected between the 
5 first resonator circuit 701 and the second resonator 
circuit 702, with a non-inverting input 703a of the 
operational amplifier 703 being coupled to the second 
connection of the coil 202 in the upstream first 
resonator circuit 701. Furthermore, an inverted input 
10 703b of the operational amplifier 703 is fed back to 
its output 703c, and is coupled to the first connection 
of the resistance 203 in the second resonator circuit 
702, which is connected downstream from the first 
resonator circuit 701. 

15 

On the basis of this configuration, the output voltage 
from the upstream resonator circuit 701 Uci 204 is equal 
to the input voltage of the second resonator circuit 
702, which is connected downstream from the first 

20 resonator circuit 701. The output current from each 
resonator circuit is equal to zero. The input current 
to the second resonator circuit 702 which is connected 
downstream from the upstream first resonator circuit 
701 is governed by the impedance of the downstream 

25 second resonator circuit 702. As is shown in Figure 7a, 
these situations can be implemented using analog 
technology by means of an impedance converter, which 
applies the output voltage from the upstream resonator 
circuit 701 to the input of the downstream resonator 

30 circuit 702. 

A wave digital filter 750 will be described, as an 

implementation of the circuit arrangement 700 shown in 

Figure 7A, in the following text with reference to 
35 Figure 7B. 

The wave digital filter 750 is subdivided into a first 
component 751 and a second component 752, with the 
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first component 751 representing the first resonator 
circuit 701, and with the second component 752 
representing the second resonator circuit 702. As can 
be seen, the two components 751, 752 are coupled to one 
5 another by virtue of the functionality of the 
operational amplifier 703. The internal design of each 
of the components 751, 752 corresponds essentially to 
the configuration shown in Figure 3. The input signal 
to the first component 751 is U, and the input signal 
10 to the second component 752 is Uci . 

The combination of the linear filter bank 808 with the 
non-linear compression stages 101 simulates, according 
to the invention, the non-linear oscillation response 
15 of the inner ear of mammals very well, as has been 
explained above in conjunction with Figure 5. 

In particular, a high degree of dynamic compression is 
achieved for sound levels in the range from 0 dBspL to 
20 120 dBspL to a range from 1 nm to 100 nm (this 
corresponds approximately to 40 dB) (see Figure 10) , 
and this is of major importance for the further 
processing carried out in the course of feature 
extraction and speech recognition. 

25 

The diagram 1000 in Figure 10 illustrates the cochlea 
position along the abscissa 1001, and the deflection of 
the basilar membrane which occurs at each respective 
cochlea position, along the ordinate 1002. As can be 

30 seen, the diagram 1000 thus represents an excitation 
pattern (RMS value) of the non-linear basilar membrane 
model for a 1 kHz tone. The illustrated curves 1003, 
1004, 1005, 1006, 1007, 1008, 1009 are very highly 
compressed at the point 1010 with the greatest 

35 sensitivity (illustrated as a dashed line in Figure 10) 
normally at the position of 21 mm with respect to the 
null position of the cochlea. Furthermore, Figure 10 
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shows an excitation threshold 1011, above which the 
human hearing system perceives a signal deflection. 

The basilar membrane signal Xbmi which is produced by 
5 the respective resonator circuits 101 at the end of 
each series circuit is supplied to a respective 
filter-output circuit 1100, as is illustrated in Figure 
11. 

10 Each filter-output circuit 1100 has a high-pass filter 
1101, a rectifier circuit 1102 coupled downstream from 
it on the output side, a low-pass filter 1103 coupled 
downstream from it in the signal flow direction, an 
activation circuit 1104 coupled downstream from this in 

15 the signal flow direction, as well as a vesicle pool 
circuit 1105 and a neurotransmitter circuit 1106. 

The respective basilar membrane signal Xbmi, Xbmi, Xemh 
is high-pass filtered and scaled by means of the 

20 high-pass filter 1101, which has a capacitance 1107 and 
a resistance 1108, so that only the respective 
speech-relevant dynamic range is extracted by means of 
a second-order Boltzmann function. The still relatively 
flat flank of the filter curves of the inner ear, as is 

25 illustrated in Figure 10, is sharpened somewhat by 
means of the first-order high-pass filter 1101. 

According to this exemplary embodiment of the 
invention, the cut-off frequency of the high-pass 
30 filter 1101 corresponds to the frequency of maximum 
sensitivity of the basilar membrane oscillation. 

The asymmetry of the Boltzmann function that is used 
according to the invention results in rectification of 
35 the signal (produced according to this exemplary 
embodiment by means of the rectifier circuit 1102, 
which is low-pass-filtered in the next stage by means 
of the low-pass filter 1103, so that a 
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receptor-potential signal Um is produced at the output 
of the low-pass filter 1103. 

According to this exemplary embodiment of the 
5 invention, the resistance 1109 in the low-pass filter 
1103 has an admittance of gn = 60 nS, by analogy with 
the cell membrane, and the capacitance 1110 of the 
low-pass filter 1103 has a capacitance of Cm = 12 pF, 
simulating the cell membrane. 
10 The activation of the respective cell, simulated 
according to the invention by the vesicle pool circuit 
1105 and the neurotransmitter circuit 1106, is 
calculated from the receptor potential Um by means of a 
Boltzmann function. 

15 

The low-pass filtering and the rectification, as 
described above, have a number of effects: 

a) at low signal frequencies, there is one and only 
20 one maximum excitation of the sensor cells, per 

cycle of the acoustic stimulus, 

b) acoustic signals in the frequency range above the 
cut-off frequency of the inner hair cells lead to 

25 activation on the basis of their envelope curve, 

and 

c) the sensitivity and saturation of the Boltzmann 
function result in focusing of the sound 

30 processing on speech-relevant information. 

In other words, the further processing of the 
receptor-potential signal Um is carried out by the 
vesicle pool circuit 1005 and the neurotransmitter 
35 circuit 1106 being designed such that changes over time 
in the sound signal (that is to say in the input 
signal) are emphasized, and signal components of the 
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input signal which remain the same, and are essentially 
constant over time, are ignored (adapted) . 

This results in effective suppression of stationary 
5 signals (for example interference noise) . 

According to this exemplary embodiment of the 
invention, the adaptation is modeled by means of the 
vesicle pool circuit 1105, with the simulated vesicle 

10 pool being filled continuously (but slowly) to its 
nominal value. A neurotransmitter current (according to 
this exemplary embodiment of the invention at a rate of 
28000/s) is produced by the vesicle pool circuit 1105 
relative to the instantaneous vesicle pool size and a 

15 probability which is derived using a Boltzmann function 
from the membrane potential of the inner hair cell. 

In the case of large-amplitude sound signals, the 
majority of the vesicle pool is dissipated, so that 
20 subsequent signal components generate only a small 
signal, that is to say a signal with a small amplitude. 

In phases during which the input signal has a small 
amplitude applied to it, the vesicle pool is 

25 regenerated again. In other words, this means that the 
vesicle pool circuit 1105 simulates the functionality 
described above, and is designed in such a way that two 
time constants are provided, specifically a first time 
constant of ii = 140 milliseconds and a second time 

30 constant of T2 = 3 ms . 

The neurotransmitter current flows into the "synaptic 
gap", where it is dissipated with a time constant 
T3 = 1 ms by means of the neurotransmitter circuit 1106 
35 which, according to the invention, is simulated by the 
neurotransmitter . 
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In addition to the vesicle function from the vesicle 
pool, a further neurotransmitter current is produced, 
which depends only on the membrane potential of the 
inner hair cell, so that the chosen model is based on 
5 an infinite vesicle pool size, and a rate of 9000/s. 

The two neurotransmitter currents allow adequate coding 
of stationary and transient sound signals, that is to 
say adequate coding of nerve-action potentials. 

10 

The vesicle pool 1105 can be modeled not only 
continuously but also comprising discrete vesicles. In 
the case of discrete modeling, the neurotransmitter 
current is in the form of a stochastic process. This 
15 procedure is chosen in order to code the switching 
signal into discrete nerve-action potentials. 

A nerve-action potential is illustrated in a 
nerve-action potential diagram 1200 shown in Figure 12, 
20 and is triggered when the concentration of the 
respective neurotransmitter in the synaptic gap exceeds 
a predetermined threshold value, according to this 
exemplary embodiment 1.0 vesicles. 

25 Figure 12 shows the modeled nerve-action potentials 
that are produced for stimulation with a synthetic 
vocal "e" . 

If there are two formant frequencies, of the vocal "e", 
30 this results in excitations. Furthermore, this is 
achieved on the time structure (in particular for the 
second format) which is modulated with the fundamental 
speech frequency (100 Hz corresponding to 10 ms) . 

35 One highly advantageous characteristic of the 
feature-extraction unit 801 according to the invention 
is that it can be evaluated and optimized on the basis 
of the achievable recognition performance in the course 
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of a speech-recognition method in an automatic 
speech-recognition system. 

The diagram 1300 in Figure 13 illustrates the 
5 speech-recognition performance of a conventional speech 
recognition method based on Fast Fourier Transformation 
(word error rate curve 1301) with different processing 
stages according to the invention (only inner-ear model 
component) , word error rate curve 1302 and sensor cell 
10 word error rate curve 1303 interference noise (plotted 
along the abscissa) . 

The respectively achieved word error rate is 
illustrated along the ordinate 1305 in Figure 13. 

15 

As is shown in Figure 13, without the existence of 
interference noise, the recognition performance of the 
conventional method based on Fast Fourier 
Transformation is of higher quality, due in particular 
20 to the maturity of the algorithms, which have been 
developed over several years, but the robustness of the 
features provided according to the invention is obvious 
as the interference noise rises. 
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